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Abstract. An indeterminate string (or, more simply, just a string) 
X = a:[l..n] on an alphabet 17 is a sequence of nonempty subsets of E. 
We say that a:[ii] and x[i 2 ] match (written a;[ii] ~ *[* 2 ]) if and only if 
a:[ii]na;[i 2 ] yf 0. A feasible array is an array y = y[l..n] of integers snch 
that y[l] = n and for every i £ 2..n, y\i\ £ 0..n—i+1. A prefix table of a 
string X is an array tt = 7r[l..n] of integers such that, for every i £ 

7r[i] = j if and only if x[i..i+j — l] is the longest substring at position i 
of X that matches a prefix of x. It is known from |CR,SW14] that every 
feasible array is a prefix table of some indetermintate string. A prefix 
graph V — Vy is a labelled simple graph whose strncture is determined 
by a feasible array y. In this paper we show, given a feasible array y, how 
to use Vy to construct a lexicographically least indeterminate string on 
a minimum alphabet whose prefix table tt — y. 


1 Introduction 

In the extensive literature of stringology/combinatorics on words, a “string” 
or “word” has usually been defined as a sequence of individual elements of a 
distinguished set S called an “alphabet”. Nevertheless, going back as far as the 
groundbreaking paper of Fischer & Paterson [FP74j . more general sequences, 
defined instead on subsets of E, have also been considered. The more constrained 
model introduced in [FP74| restricts entries in a string to be either elements of 
E (subsets of size 1) or E itself (subsets of size a = |I7|); th ese have been 
studied in recent years as “strings with don’t cares” [IMM+O,^ . also “strings 
with holes” or “partial words” |BS08j . The unconstrained model, which allows 
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arbitrary nonenipty subsets of S, has also attracted significant attention, often 
because of applications in bioinformatics: such strings have variously been called 
“generalized” |Abr87| . “indeterminate” |HS03j . or “degenerate” [IMROSj . 

In this paper we study strings in their full generality, hence the following 
definitions: 

Definition 1. Suppose a set S of symbols (called the alphabet) is given. A 
string x on S of length n = |a:| is a sequence of n > 0 nonempty finite subsets 
of S, called letters; we represent x as an array x[l..n]. If n = 0, x is called the 
empty string and denoted by e; if for every i S l..n, x[i\ is a subset of S of 
size 1, X is said to be a regular string. 

Definition 2. Suppose we are given two strings x and y and integers i € l--|a;|, 
j G We say that x[i] and y[j] match (written x[i] « y[j]) if and only 

if x[i] n y[j] 0. Then x and y match (x k, y) if and only if |a;| = \y\ and 
x[i\ K. y[i] for every i G 

Note that matching is not necessarily transitive: a « {a, 6} « 6, but a ^ b. 

Definition 3. The prefix table (also prefix array of a string x = a:[l..n] is 
the integer array tzx = T^x\f--n] such that for every i G l..n, r^xii] is the length 
of the longest prefix of x[i..n] that matches a prefix of x. Thus for every prefix 
table TVx, 7ra;[I] = n. When there is no ambiguity, we write tt = ttx. 

The prefix table is an important data structure for strings: it identifies all the 
borders, hence all the periods, of every prefix of x |CRS W 1^ . It was originally 
introduced to facilitate the computation of repetitions in regular strings |ML84| , 
see also |Smy03| ; and for regular strings, prefix table and border array are equiv¬ 
alent, since each can be computed from the other in linear time |BKS13| . For 
general strings, the prefix table can be computed in compressed form in 0{n^) 
time using 0{nf(7) bytes of storage space |SW08| . where a = lifl. Two examples 
follow, adapted from |CRSW14] : 


12345678 

Xx=acagacat (1) 

TTi = 80103010 

1 2 3 4 5 6 7 8 

X 2 = {a, c} {g, t} {a, g} {a, c,g} g c {a, t} a (2) 

772 =8 0 4 2 0311 

Since clearly every position i G 2..n in a prefix table tt must satisfy 0 < 
7r[z] < n — i + 1, the following definition is a natural one: 

An array y — y\\..n] of integers is said to be a feasible array if and 
only if y[l] = n and for every i G 2..n, y[i] G Q..n — i-\-l. 

® We prefer “table” because of the possible confusion with “suffix array”, a completely 
different data structure. 
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An immediate consequence of Definition is the following: 

Lemma 1 r |CRSW14p . Let X = a;[l..n] he a string. An integer array y = 
y[l..n\ is the prefix table of x if and only if for each position i € l..n, the 
following two conditions hold: 

(a) x[l..y[i]] Ki x[i..i + y[i]-l]; 

(b) if i + y[i] < n, then x[y[i] + l] x[i + y[i]]. 

Then the following fundamental result establishes the important connection be¬ 
tween strings and feasible arrays: 

Lemma 2 (' |CRSW14p . Every feasible array is the prefix table of some string. 

In view of this lemma, we say that a feasible array is regular if it is the 
prefix array of a regular string. We are now able to state the goal of this paper 
as follows: for a given feasible array y = y[l..n], not necessarily regular, construct 
a string a; on a minimum alphabet whose prefix table ttx = y — the “reverse 
engineering” problem for the prefix table in its full generality. In fact, we do 
somewhat more: we construct a lexicographically least such string, in a sense to 
be defined in Section |2] 

The first reverse engineering problem in stringology was stated and solved 
in |FLR+99lFGL+nn| . where an algorithm was described to compute a lexico¬ 
graphically least regular string whose border array was a given integer array — 
or to return the result that no such regular string exists. Many other similar con¬ 
structions have since been published, related to other stringological data struc¬ 
tures but always specific to regular strings (see |BIST03IDLL05IFS06IMNRR,13| , 
among others). |NRR12j was the first paper to consider the more general prob¬ 
lem of inferring an indeterminate string from a given data structure (specifically, 
border array, suffix array and LCP array). Although solving such problems does 
not yield immediate applications, nevertheless solutions provide a deeper un¬ 
derstanding of the combinatorial many-many relationship between strings and 
the various data structures developed from them (see for example |MSM99j . 
where canonical strings corresponding to given border arrays are identified and 
efficiently generated for use as test data). 

For prefix tables and regular strings, the reverse engineering problem was 
solved in [CCR09) , where a linear-time algorithm was described to return a lexi¬ 
cographically least regular string x whose prefix table is the given feasible array 
y, or an error message if no such x exists. A recent paper |BSBW1^ sketches 
two algorithms to compute an indeterminate string a; on a minimum alphabet 
(not necessarily lexicographically least) corresponding to a given feasible array 
y, but the algorithms are theoretical in nature: one requires the determination of 
the chromatic number of a certain graph, an NP-hard problem, while the other 
depends on somehow identifying the minimum “induced positive edge cover” of 
a graph. However, |BSBWI4] proves an important result that we use below to 
bound the complexity of our algorithm: that the minimum alphabet size cr of a 
string corresponding to a given feasible array of length n is at most n + ^/n. In 
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this paper we use graph-theoretic methods developed from |CR,SW14] to com¬ 
pute a lexicographically least string, regular or not, corresponding to the given 
y, in time 0 {ari^). 

Section of this paper provides background material for an understanding of 
our algorithm; Section [^presents the algorithm itself; Section [^briefly discusses 
these results and suggests future work. 

2 Preliminaries 

Following [CRSWM], for a given feasible array y = y[l..n], we define a corre¬ 
sponding graph V = Vy, on which our algorithm will be based: 

Definition 4. LetV = {V, E) he a labelled graph with vertex setV = {1, 2,..., n} 
consisting of positions in a given feasible array y. In V we define, for i G 2..n, 
two kinds of edge (compare Lemma^: 

(a) for every h G l..y[i\, {h,i + h—l) is called a positive edge; 

(b) (l-\-y{i\,i-\-y{i\) is called a negative edge, provided i+y[i] < n. 

E'^ and E~ denote the sets of positive and negative edges, respectively. We write 
E = E~^ U E~, = (V, E'^), 'P~ = (V, E~), and we call V the prefix graph 

Vy of y. If X is a string having y as its prefix table, then we also refer to V as 
the prefix graph Vx of x. 

Observe that E~^ and E~ are necessarily disjoint. Figures [T||^ show the prefix 
graphs for the example strings (|^ and Q. 

1 2345678 

Xs = {a, b} {a, c} c {a, b} b c {a, c} b (3) 

773 = 8 2 0 1 4 0 1 1 

1 2 3 45678 

X 4 = {a, b} {a, c} {a, d} {c, e} a {b, e} c d (4) 

774 =8 2 4 01300 

The following lemma will be useful for the analysis of our algorithm: 

Lemma 3 ( |CRSW14| L Let Vy = (y, E) be a prefix graph of a feasible array 
y. Then y is regular if and only if every edge of Py joins two vertices in distinct 
connected components of Py . 

So as to discuss the lexicographical ordering of strings on an ordered alphabet 
E, we need first of all a definition of the order of two letters: 

Definition 5. Suppose two letters A and pt are given, where 
A = {Ai, A 2 ,..., Aj}, p = {pi,p 2 , ■ ■ ■, Pk}, 

with Xh G E for every h G l..j, ph G E for every h G l..k. We assume without 
loss of generality that j < k, also that Xh < Xh+i for every h G l..j —1 and 
Ph < Ph+i for every h G \..k — l. Then X = p if and only if Xh = Ph for every 
h G l..k and j = k; while X < p if and only if 
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(a) Xh = Ph for every h G l..j < k; or 

(b) Xh = ph for every h&l..h' < j and Xh'+i < Ph'+i- 

Otherwise, /i ^ A. 

Note that {X = pt) ^ {X ~ fj,), but that A « /x implies neither equality nor 
an ordering of A and p. We remark also that the definition of letter order 
given here is not the only possible or useful one. For example, it would re¬ 
quire {a,b,w,x,y, z} -< {a,c}, thus arguably not placing sufficient emphasis on 
economy of letter selection in the alphabet. 

Definition 6. Now suppose that two strings Xi = a;i[l..ni] and X 2 = X 2 [I..n‘ 2 \ 
on S are given, where without loss of generality we assume that ni <n 2 - Then 
if ond only if Xi[h] = X 2 [h] for every h G l..n 2 and ni = 7 x 2 ; while 
^ X 2 if and only if 

(a) = X 2 [h] for every h G l..ni < 7 x 2 ; or 
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(b) Xx[h\ = X 2 ,[h] for every h € l..h' < ni and Xx[h’-\-l\ -< x^lh’-\-V\. 
Otherwise, X 2 ^ x^. 

To better illustrate the relation of strings defined and used in this paper we 
present the following examples: 

- xi= {a,c} {g,t} a^X 2 = {a,c} {g,t} {a,g} 

- xi = a {g, t} {a, c} {a, c,g} <X 2 = a {g, t} {a, t} a 

- xi= a {a, c, g} {a, c, g} {a, t} ^ X 2 = {a, c} g g {a, t} 

where a < c < g < t. 

3 Algorithm RevEng 

3.1 The Algorithm 

The basic strategy of Algorithm RevEng, that constructs a lexicographically 
least string x (initially empty) corresponding to a given feasible array y = 
y[l..n\, is expressed by the main steps given below. Initially the alphabet E is 
empty (<7 = 0), as are the sets x[i], 1 <i <n. 

(51) Consider the edges (i, j) of in increasing order of ni+j in order to add 
a single letter to x[i\, x[j], or both based on the following steps; 

(52) if, by virtue of previous assignments, x[i\ « x\j\ (so neither is empty), 
there is nothing to do — (*> j) can be skipped; 

(53) otherwise, for the current (i,j), determine a sequence 

C = (Ai, ii), (A 2 , * 2)5 ■ ■ •, (Ar, ir) 

of all candidate assignments, where for every h € l..r, ih = i (respectively, 
j) if Xh G x[j] (respectively, x[i]), and Ai < A 2 < • • • < A^; 

(54) for the current h, determine whether or not the assignment 

x[ih] ^ x[ih] U {A/i} 

is “allowable” (that is, compatible with the neighbourhood of ih in E~) — if 
so, then perform the assignment, maintaining the elements of x[iy\ in their 
natural order; 

(55) if for no h is the assignment allowable, then assign a least new letter (drawn 
WLOG from the set of positive integers) to both x[i] and x\j\, 

(56) since it may be that after Steps (S1)-(S5) have been executed for every 
{i,j) S there still remain unassigned positions in x (that is, correspond¬ 
ing to isolated vertices in V^), a final assignment of a least possible letter 
for these positions is required (see function LEAST(i, A^ax) and Lemma|^. 

In order to implement this algorithm, several data structures need to be 
created, maintained, and accessed: 
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(DSl) The edges of E'^ are made accessible in increasing order for Step (SI) by 
a radix sort of the positive edges (i,j) into a linked list L+ (i.e., the linked 
list L'^ contains the edges of E~^ in increasing order), whose entries occur in 
increasing order of i and, within each i, in increasing order of j. The time 
requirement is 6>(|i?“''|), thus 0 {n?) in the worst case, since E'^ can contain 
0(n2) edges )(:RSW14| . 

(DS2) In order to implement Step (S4) of Algorithm RevEng, we need, for each 
position i G l..n, to have available a linked list of positions j such that (i,j) 
is an edge of E~. This can be done by using E~ to form a set of negative 
edges that includes each (i,j) twice, both as (i,j) and as (j,*). Then in 
a preprocessing step the entries in this set are radix sorted into an array 
N~ = 7V“[l..n] of n linked lists, such that for every i G l..n, -/V“[z] gives in 
increasing order all the vertices j for which (i, J) G E~ (in other words, the 
neighbourhood of i in E~ ). Since E~ contains 0{n) edges |CRSW14] . this 
preprocessing step can be accomplished in 0 {n) time. 

(DS3) Steps (S2)-(S5) require that for each i G l..n, a linked list x[i\ be main¬ 
tained of letters A that have so far been assigned to x\i\. Each list is main¬ 
tained in increasing letter order, so that update, intersection, and union each 
require 0{a) time, where a = |A7| is the (current) alphabet size. Since for 
regular strings each x[i\ has exactly one element, in this case processing time 
reduces to 0(1). 

(DS4) In Step (S4), in order to determine whether a proposed assignment of 
a letter to a position in x is allowable or not, we form a “forbidden” 
matrix E[l..n, l..cr] in which E[i, A] = 1 if and only if A G x[i] is forbidden. 
F is updated and used as follows: 

— for each new letter Amax introduced in Step (S5), F\i, Amax] is initialized 
to zero for all i G l..n; 

— whenever an assignment x[i] ^ A is made in Steps (S4) & (S5), set 

F[j, A] •(— 1 for every j G (procedure UPDATE_F(i, A)). 

Figures and 1^ give pseudocode for Algorithm RevEng and function LEAST, 
respectively. 

3.2 Correctness 

Consider first the main while loop of Algorithm ASSIGN, in which the edges of 
E+ are considered in strict increasing (f,j) order. We see that new letters Amax 
are first introduced at the leftmost possible positions in x. Thereafter, whenever 
a letter is reused (Amax not increased), it is always the minimum possible letter 
consistent with the least possible currently unfilled positions (*, j) that is used — 
by virtue of the fact that the entries in C are maintained in increasing order of A. 
Thus any automorphism of the alphabet E other than the identity would yield 
a larger string. We conclude that within the main while loop the assignments 
maintain lexicographical order ^ as defined in Section]^ 

It may happen, however, that certain positions i in a; remain empty, those 
corresponding to isolated vertices i in . Assignments to these positions are 
handled by the final for loop, which we now consider. 
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procedure RevEng {V,x,n) 

Amax ^ 0; X ^ 0"; F[l..n, l..a] ^ 0”^ 
while top(E+) 7 ^ 0 do 

{i,j) ^ pop(L+); (7^0 > i < j; ni+j a minimum 

if x[i\ n x[j] = 0 then 

VA G x[i\ do Cl ^ (A, j) t> ordered by A 

VA € x[j] do C 2 ^ (A,z) t> ordered by A 

> Merge Ci and C 2 into a single sequence ordered by A. 

C ^ MERGE(Ci,C 2 ) 

SET false 

while top(C') 7 ^ 0 and not SET do 
(X,h) ^ pop(C') 
if E[h, A] 7 ^ 1 then 

x[h] X t> maintain X ordering 

SET ^ true; UPDATE_F(/i, A) 
if not SET then 
Amax ^ Aniax“t“l 

for h ^ 1 to n do F[h, Amax] ^ 0 
x[ij i Amaxi UPDATE_F(z, Amax) 

x[j] ^ -^max! UPDATE_F(j, Amax) 

for z ^ 1 to n do 
if x[i] =0 then 

> Identify the least letter X that does not occur 

> in any x\j] for which j G -/V~[*]- 

A ^ LEAST(i, Amax)i Amax ^ inax(A, Amax) 
x[i] ^ X 

Fig. 5. Given the preprocessing outlined in (DS1)-(DS2), Algorithm RevEng computes 
a:[l..n], the lexicographically least string corresponding to a given prefix (feasible) graph 
V on n vertices. 


Lemma 4. A vertex i G l..n is isolated in V~^ if and only if 

(a) y[i] =0 or z = 1 ; and 

(b) for every j G 2..n, y[j] < i; and 
(e) for every j G l..z—1, j+y[j] < i- 


Proof. First suppose that z is isolated. Then (a) must hold; otherwise, for z > 1 
and y[i] > 0, there exists an edge (l,z) G E+, a contradiction. If (b) does not 
hold, there exists j > I such that y[j] = r > i, implying x[l..r] « x[j..j+r—l], 
hence x[i\ « x[j+z—1], so that (z, j+z—1) G E+, again a contradiction. Similarly, 
if (c) does not hold, there exists j G l..z—1 such that y[j] = r with j + r > i. 
Consequently, x[j..j+r—V\ ss x[l..r] implying x[i] « x[z—j+1], so that (z—j+1, z) G 
a contradiction that establishes sufficiency. 
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function LEAST(i, Amax) 

5[1.-Amax] ^ 

Vj G -/V“[z] do 
VA G x[j] do 
B[X] ^ 1 

A^ 1 

while A < Aniax and _B[A] = 1 do 
A ^ A+1 

Fig. 6. Identify the least letter A that does not occur in any x[j] for which j G 

Suppose then that conditions (a)-(c) all hold. If we assume that I = 1 in (a), 
then (b) implies that for every j G 2..n, y[j] = 0, so that = 0 and so every 
position i is isolated. Otherwise, if y[i] = 0 for some t > 1, conditions (b) and 
(c) ensure that position i is not contained in any matching range within x and 
is therefore isolated in 7^+, as required. □ 

Since in the main while loop new maximum letters Amax are introduced into 
pairs i and j > I of positions in x that are determined by entries in y, it is 
an immediate consequence of Lemma that i must be less than any isolated 
vertex, in particular the smallest one, imin) say. In other words, every letter 
assigned during the execution of the main while loop occurs at least once to 
the left of imin in a;. It follows that lexicographical order will be maintained if 
any required additional letters Amax+1, Aniax+2, • • • are assigned to an ascending 
sequence of positions in x determined by the isolated vertices in . We have 

Lemma 5. Given a prefix graph Vy corresponding to a feasible array y, Al¬ 
gorithm RevEng constructs a lexicographically least indeterminate string on a 
minimum alphabet whose prefix table n — y. 

3.3 Asymptotic Complexity 

The main while loop in Algorithm RevEng will be executed exactly |if+| times. 
Within the loop the construction of C requires time proportional to \C\ = |£c[i]|+ 
|a:[j]|, thus 0{a) in the worst case. The processing of C then also requires 0{a) 
time in the worst case, except for the time required for UPDATE_F. Each of the 
three calls of UPDATE_F corresponds to the assignment of a letter A to a vertex 
i of 'P~ and the ensuing update of F[z,A], an event that can occur at most tr 
times for each edge in E~, thus at most (cr x times overall. Similarly, the 

for loop that initializes the F array requires 0{n) time for each of at most tr 
values of Amax- 

We conclude that the worst-case time requirement for the while loop is 
0(crmax(|if+|, |if“|)). As illustrated by the examples y = and y = n\n— 

1|...|1, the bounds on these quantities are as follows: 0 < \E~^\ < ( 2 ) and 
0 < \E~\ < n — 1, while > n — 1. As noted earlier, it was shown in 

[BSBW14] that a < n-F-y/n, with a further conjecture that a <n. 
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Turning our attention to the terminating for loop of Algorithm ASSIGN, 
we observe that for at most n executions of function LEAST, the binary array B 
of length at most a must be created, thus overall consuming 0{an) time. The 
nested V loops in LEAST set positions in i? A times for at most every edge in E ~, 
again requiring at most 0{an) time over all invocations of LEAST. Thus 

Lemma 6. Algorithm RevEng requires 0{an^) time in the worst ease, where 
a < n + y/n. 

3.4 Example 

Suppose y = 50210, so that A+ = 13,14, 24 and E~ = 12,15, 25,35. 

— In A+ first consider edge 13, leading to assignments x[l\ •(— a, a;[3] ^ a, 
with F[2,a] = E[5,a] = 1 since both 12 and 15 are edges of E~. 

— Edge 14 of E'^ leads to a;[4] ^ a and no new values in F, since vertex 4 is 
isolated in E~. 

~ Edge 24 of A+ requires a new letter because E[2, a] = 1. Therefore we assign 
a;[2] ^ b and a;[4] ^ b, while setting F[l,b] = A[5, 6] = 1 because of the 
edges 21 and 25 in E~. 

— Finally we deal with the isolated vertex 5 in E+ by setting a: [5] ^ c since 
15 S E~ and a;[l] = a, while 25 S E~ and a;[2] = b. 

The lexicographically least string is a; = aba{a, b}c. 

3.5 Computational Experiments 

To get an idea of how the algorithm behaves in practice, we have implemented 
Algorithm RevEng and conducted a simple experimental study. A set of 1000 
feasible arrays having lengths 10, 20,.., 100 has been randomly generated as fol¬ 
lows. For each feasible array y we randomly select a value for y[i],i G [l..n] 
from within the range [0..n — i + 1], The experiments have been run on a Win¬ 
dows Server 2008 R2 64-bit Operating System, with Intel(R) Core(TM) i7 2600 
processor @ 3.40GHz having an installed memory (RAM) of 8.00 GB. We have 
implemented Algorithm RevEng in C# language using Visual Studio 2010. As 
is evident from Figure the experiments suggest that average case time also 
increases by a factor somewhat greater than n^. 

4 Discussion 

The high worst-case time complexity of the algorithm described here suggests 
room for improvement. On the other hand, it is difficult to imagine an algorithm 
that could do the same computation without considering all the edges of E+ and 
thus necessitating 0(n^) time for many instances of the prefix table tt. Similarly, 
the requirement to achieve a lexicographically least solution leads to a recurring 
dependence on alphabet size a that expresses itself in the time complexity. Even 
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Fig. 7. Timing results for randomly-generated feasible arrays y. 


though it may be true that cr < n, nevertheless it seems clear that a can be much 
larger than in the regular case, where it has been shown |CCR09ICRSW14] that 
a < |■log 2 n]. 

We have tried approaches that focus on E~ rather than E'^ as the primary 
data structure, but without success. In particular, we have considered “triangles” 
where both and ( 12 , j) are edges in E'^ , while (ii,i 2 ) € E~ , a 

situation that forces a string to be indeterminate. It turns out, however, that 
the number of such triangles is O(n^). Similarly, the ingenious graph proposed in 
[BSBW14] , whose chromatic number is the minimum alphabet size cr of a string 
corresponding to a given prefix table, has O(n^) vertices in the worst case. 

At the same time, we have no proof that our algorithm is asymptotically 
optimal; for example, an algorithm that could eliminate the a factor in the com¬ 
plexity would be of considerable interest. Also interesting would be an algorithm 
for indeterminate strings that would execute in 0{n) time on regular strings as 
a special case, thus matching the algorithm of |CCR09) . More generally, we pro¬ 
pose the study of indeterminate strings (“strings” as we have called them here), 
their associated data structures (such as the prefix table), and their applica¬ 
tions as a promising research area in both combinatorics on words and string 
algorithms. 
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